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1 .  Introduction 

The  research  performed  on  optical  image  processing  falls  into  three  rather 
broad  categories:  image  coding,  image  enhancement,  anc  mage  detection.  Con¬ 
sideration  has  been  given  both  to  monochrome  and  color  i '  qes.  Also,  a  study  has 
been  undertaken  to  evaluate  the  implementation  requirmei  tr  of  one  of  the  more  pro- 
i  mising  image  processing  techniques. 

The  following  sections  summarize  the  results  of  some  of  the  research  performed 
in  this  study.  More  detailed  accounts  of  the  various  research  projects  are  to  bo  found 
in  the  published  papers  listed  at  the  er.d  of  the  section. 
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2.  Spatial  Transform  Coding  of  Color  Images 


Digital  image  transforms  have  been  applied  quite  successfully  to  obtain 
a  bandwidth  reduction  >  ad  tolerance  to  channel  errors  for  monochrome  images  [l-4]. 

The  potential  for  monochrome  image  bandwidth  compression  with  transform  coding 
arises  from  the  spatial  correlation  within  natu*al  images.  As  a  result  of  this 
spatial  correlation,  the  image  energy  within  the  transform  domain  tends  to  be 
clustered  toward  a  relatively  few  number  of  transform  samples.  Low  magnitude 
samples  may  either  be  discarded  completely  or  coded  with  a  few  number  of  bits 
without  seriously  effecting  image  quality.  Sample  reductions  on  the  order  of 
5:1,  compared  to  conventional  PCM  coding,  have  been  reported  for  transform 
coding  cf  monochrome  images  [l-3]. 

Preliminary  studies  have  indicated  that  the  spatial  redundancy  of  color 
images  and  the  limitations  of  human  color  vision  can  be  exploited  by  transform 
coding  to  achieve  a  substantial  bandwidth  reduction  for  the  digital  transmission 
of  color  television  and  facsimile  [4-8].  Figure  1  contains  a  block  diagram  ' 

■of  a  color  image  transform  coding  system.  In  the  system  the  color  image  is 
represented  by  three  source  tristimulus  signals  R(x,y),  G(x,y),  D(x,y)  that 
specify  the  red,  green,  and  blue  content  of  an  image  element  at  coordinates 
(x,y),  according  to  the  N.T.S.C.  receiver  phosphor  primary  system  [9]. 

The  source  tristimulus  signals  are  then  converted  to  a  new  three  dimensional  space  through 


some  linear  or  nonlinear.;  invertible  coordinate  conversion  process.  The  objective 
of  the  coordinate  conversion  is  to  produce  three  planes  of  data,  ^(y.y),  f2(x,y), 

f  (x,y),  to  be  called  color  signals,  that  are  most  amenable  to  transform  coding. 
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Next,  a  spatial,  unitary  transformation  is  performed  on  each  color  signal  plane 
resulting  in  three  transform  domain  planes  F^u.v),  F2ru,v),  F3(u,v).  Quantization 
and  coding  to  achieve  a  bandwidth  reduction  is  then  performed  on  the  three  transform 
domains.  At  the  receiver,  the  channel  output  is  decoded,  and  the  inverse  spatial 
transforms  and  inverse  coordinate  conversion  operations  are  performed  to  recon¬ 
struct  the  source  trist.mulus  signals.  3 


Color  Imarc  Transform  Corlinp 

Consider  the  color  image  transform  coding  system  of  Figure  1  in 
which  the  color  coordinate  conversion  is  linear.  The  three  color  signals 
planes  can  be  expressed  as 


i  1  =  nij  jR  +  +  m^]B 


f2  =  m21R  +m22G  +m2JB 


»  =  m3i£.  +  m32G  +m33B 


Each  of  these  color  signal  planes  is  then  separately  transform  ;d,  by 
perhaps  a  different  unitary  transform  (Ak^,  Ak  ),  to  produce  three  transform 
planes 


F 1  =  A1  fl  Al  =  m  Al  R  Al 

+  mnAl  G  A 1 

+  m.  Al  B  Al 

7 
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+  m  _A2  G  A2 

+  m  A2  B  A2 
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7 
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F3  =  A?yf3  A3x 

=  mo  ,  A3  R  A3 

J  1  - y - x 

+  m  _ A3  G  A3 
32 — y - x 

+  m  ,A3  B  A3 
33 — y - k 

Table  1  defines  several  unitary  transforms  that  have  been  examined  for 
color  image  transform  coding.  Trom  the  above  it  is  apparent  that  the  order 
of  the  color  coordinate  conversion  and  two  dimensional  forward  transfor¬ 
mation  processes  is  immaterial.  In  fact,  the  color  coordinate  conversion 
can  be  considered  as  the  third  dimension  of  a  separable  three  dimensional 
data  transformation. 

Next,  each  transform  domain  plane  is  quantized  and  coded.  The  quanti¬ 
zation  operation  is  a  nonlinear  process  that  replaces  each  transform  domain 

/\  /s  /\ 

sample  by  its  best  estimate  FT2,  F^3.  Then,  after  reception  over  the 
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Table  1 

Unitary  Data  Transforms 


Fourier:  Discrete  sinusoidal  basis  vectors 


where  N  =  data  vector  length 

x  =  data  vector  element  index 

:u  -  transformed  data  vector  element  index 

Computation  requires  about  N  iog^  N  complex  multi¬ 
ply  and  add  operations. 


Hadamard:  Sequency  ordered  orthogonal  rectangular  waveform 
basis  vectors 

a  =  hnp  a_1=a 

where  P  =  row  permutation  index 


—  N/2  “N/2 
“N/2  "“N/2. 


—2  V? 
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Computation  requires  about  N  log^  N  add  operations 


Note:  Many  other  Hadamard  matrices  of  dimension  N  mod  4 
are  known  to  exist. 


Karhunen-Loeve;  Basis  vectors  are  eigenvectors  of  data 
covariance  matrix. 

-1  T 

A  =  K  A  =  K 

where  K  C{  K  1  =  diag  |  A  (i)  | 

C^  =  covariance  matrix  of  data  vector 
K  -  eigenvectors  of  C, 

i. 

A  (i)  =  eigenvalues  of 
2 

Computation  requires  N  multiply  and  add  operations 

Note:  If  C^.  is  a  Marxov  process  covariance  matrix  ,i.e.  C  =(/^Xi  Xj  ^ 
wliere  P  is  the  adjacent  clement  correlation  factor ;  the  eigen¬ 
values  and  eigenvectors  can  be  found  inclusively 
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channel,  assuming  no  channel  errors,  inverse  unitary  transforms  are 
taken  to  reconstruct  the  coior  signals 
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ArVl  ai"1 

- y - X 

A2~1f|  A2T1 
AB^Fj  A3 


(3) 


Finally,  tae  inverse  color  coordinate  conversion  defined  by  [n  3 

ij" 

is  performed  to  give 


=  Cm..]  1 
U 


^  /» 

R.  =  Rll~  4  n!2— +  nl3^3 

®  =  n21—  +  n2  £.  +  n23—  (4> 

$  =  +  r.33/3 


Again,  the  o r d e r •  of  the  inverse  color  coordinate  conversion  and  the  inverse 
spatial  transformation  is  unimportant. 

The  design  procedure  for  the  color  image  transiorm  coding  system  of 
Figure  1  consists  of.  (a),  the  selection  ol  the  color  ;oordinate  conversion 
matrix;  (b),  the  choice  of  the  unitary  transform  for  each  color  signal  plane; 

an^  (c),  the  specification  of  the-  quantization  law  for  transform  domain  samples. 

A  considerable  effort  has  been  undertaken  to  analyze  the  expected  perfor- 
of  various  color  coordinate  conversions  and  spatial  transformations  for  color 
image  transform  coding  £l0,  11]  .  The  results  are  summarized  below; 

A.  The  bv_st  color  coordinate  conversion  from  the  standpoint  ox  color 
plane  energy  compaction  is  the  Karhuncn-Loeve  color  system  which 
requires  knowledge  of  the  correlation  between  the(R,  G,  B)color 
planes, 
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B*  The|Y,  I,  Q)color  coordinate  system,  which  is  the  United  States 
standard  for  color  television  transmission,  provides  almost  as 
much  energy  compaction  as  the  Karhunen-Loeve  color  coordinate 
system. 

C.  The  Karhunen-Loeve  spatial  transorm  provides  the  best  energy 
compaction  within  a  color  plane.  However,  the  Fourier  and 
Hadamard  transforms  perform  almost  as  well. 

A  series  of  experiments  has  been  performed  to  subjectively  determine 
the  performance  of  the  color  image  transform  coding  system  using:  a)  various 
linear  and  nonlinear  color  coordinate  systems;  b)  the  Fourier,  Hadamard,  and 
Karhunen-Loeve  spatial  transforms;  c)  limitations  in  data  block  size;  and, 
d)  different  quantization  and  coding  techniques. 

Summarizing,  the  experimental  result;  'ndirate  that: 

A)  It  is  preferable  to  transform  code  the  (Y,I,Q)  color  coordinates  rather  than 
the  (R,  G,  B)  coordinates  of  an  image. 

B)  Substantial  low  pass  filtering  can  be  performed  on  the  chrominance  signal 
of  a  color  image  without  noticeable  effect  on  resolution.  There  is  some 
color  desaturation  caused  by  the  low  pass  filtering,  but  a  ’inear  masking 
operation  can  be  employed  to  correct  for  color  shifts.  For  both  Fourier  and 
Hadamard  transform  coding  of  color  images,  three  bits  per  element  were 
assigned  to  the  luminance  component,  Y,  and  0.375  bits  per  element  were 
allotted  to  the  T  and  Q  chrominance  components  each.  Thus,  the  color 
image  was  coded  with  an  average  of  3.75  bits  per  element  compared  to  the 
;  original  which  required  18  bits  per  element. 

C)  With  a  hybrid  coding  scheme  using  a  16x  16  element  Karhunen-Loeve 

transform  for  the  luminance  signal  and  16  x  1C  Hadamard  transforms  for  the 
(1,0)  signals,  a  color  image  can  be  coded  with  only  1.75  bits  per  element. 
Preliminary  studies  indicate  that  such  a  system  could  be  implemented  for 
operation  approaching  real  time  rates. 
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ft.  Image  En  hancement 

One  of  the  basic  problems  of  image  enhancement  is  to  diminish  the  effect 
of  image  noise  arising  from  the  optical  or  electronic  sensor  that  generated  the 
image.  If  the  image  noise  can  be  statistically  characterized,  then  it  is  possible 
to  apply  Wiener  filtering  techniques  of  signal  estimation  for  image  enhancement. 

Wiener  filtering  is  a  classical  technique  of  signal  estimation  that  has  been 
applied,  primarily,  to  one  dimensional,  continuous  signals,  with  analysis  and 
implementation  based  upon  continuous  Fourier  signal  theory  [l].  It  is  possible, 
of  course,  to  perform  Wiener  filtering  operations  on  time  sampled  signals,  and 
extend  the  technique  to  two  dimensions.  Furthermore,  the  filtering  operation 
can  be  implemented  by  any  unitary  transformation,  rather  than  the  Fourier  transform 
[2-4^.  Finally,  it  is  possible  to  significantly  reduce  the  computational  require¬ 
ments  without  severely  affecting  performance  by  a  technique  of  selective  compi¬ 
lation. 


One  Dimensional  Filtering.  -  Figure  1  is  a  block  diagram  of  a  general¬ 
ized  one  dimensional  Wiener  filtering  system.  A  zero  mean,  M  element, 
data  column  vector,  f,  composed  of  additive  and  uncorrelated  zero  mean 
signal,  s,  and  noise,  n,  components  is  the  input  to  the  system.  The 


FIGURE  I .  Generalized  Wiener  Filtering 

A  unitary  transform  operation  is  performed  on  the  input  yielding 

F  =  Af  =  As+AnaS+N  .  (1) 

Table  1  defines  several  unitary  transforms  that  have  been  used  for  digi¬ 
tal  signal  processing  applications.  Next,  the  transformed  input  vector 
is  multiplied  by  the  filter  matrix,  G,  and  an  inverse  unitary  transform 
operation  is  performed.  The  resultant 

s  =  A-1GF  =  A_1GAf  (2) 
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j  is  considered  the  estimate  of  the  signal,  with  the  filter  matrix,.  G,  chosen  so 
i  that  the  mean  square  error  between  the  signal  and  its  estimate  is  minimized, 

i . 

•  Statistical  I?  representation. -For  many  classes  of  data,  a  data  vector  f  can  be 
considered  a  sample  of  a  random  process  of  known  mean  i  and  with  a  covariance 
;  matrix 


— »  v  v  T 

c,  =  (f-f)(f  -f  ) 


(3) 


.  The  covariance  matrix  of  a  unitary  transform  of  data  vector,  F  =  Af, 
likewise  is  given  by 


which  reduces  to  [5} 


CF  =  (F-F)(F*-F*)T 


C„‘  =  ACA"7  =  AC^"1 


f 


(4) 


(5) 


.  There  is  no  general  closed  form  solution  for  eq.  (5)  except  for  special  cases. 
However,  it  should  be  recognized  that  eq.  (5)  is  simply  a  separable  two 
dimensional  unitary  transform  of  C^. 

Generalized  Wic-nor  Filtering,  -In  the  design  of  the  generalized  Wiener 
filtering  system  of  Figure  1  a  filter  matrix,  G,  is  chosen  to  minimize  the  mean 
square  error 


e  =  T 


(s-s)(s-s)T 


(6) 


between  the  signal,  s,  and  its  estimate,  s.  The  expression  for  the  m.  s.e.  can 
be  rewritten  in  terms  of  transform  domain  quantities  as 


e  =  Tr  (A"*S-A’’^G{S+N)XA  "*S-A  -1G  (S+N))1 


(7) 


Making  use  of  the  fact  that 


r£*  >»<T 

SS  =  C  =  AC  A 
•  S  s 


nna=  c  -  lc  a"’1’ 

N  n 

T  t" 

SN  =  NS  -  0 


(8) 
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PH^J 
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'rY?,l\'f-  *!  h*  :?t 


typing  . 


one  obtains 


i 


e  =  Tr 


CS-2GCS+GG(Cs'lCN)j 


(9) 


Now,  a  straightforward  minimization  of  eq.  (9)  yields  the  optimum  filter  matrix 

.-1 


=  C0(Cc+C  T) 


o 


N' 


(10) 


Thus,  the  optimum  filler  matrix  may  be  determined  from  the  transform  spectral 
densities  of  the  signal  and  noise  vectors.  Alternatively,  the  optimum  filter  can 
be  found  by  a  two  dimensional  transformation 


Go  =  A  6oa 


*T 


of  a  matrix 


g  =  C  (C  -iC  ) 
O  s  s  n 


-1 


(ID 


(12) 


based  upon  the  covariance  matrices  of  the  signal  and  noise.  The  matrix  g 
will  be  called  the  response  matrix,  in  analogy  with  the  impulse  response 
of  a  linear  system. 


O 


Substitution  of  the  optimum  filter  matrix  into  eq,  (9)  gives  the  minimum 
m.  s.  e. 


e  .  =  Tr 

min 


CSCN(CS+CN» 


-1 


Alternatively,  the  m.  s.  e.  can  be  written  in  Terms  of  thc\  data'  domain 
covaria  oe  matrices  as 


(13) 


mm 


-  Tr 


C  C  (C  +C  ) 
s  n  s  n 


-1 


(14) 


Note  that  from  eq.  (14),  the  minimum  m.  s.  e.  is  independent  of  the  type  of 
unitary  transform  employe^.  However,  '.he  character  of  the  filter  matrix,  G, 
is  dependent  upon  the  unitary  transform.  The  conclusion  is  that  one  is  free 
to  choose  the  transform  that  will  minimize  the  computational  processes  entailed 
in  filter  generation  and  filter  operation. 

Table  2  lists  the  number  of  multip  ications  required  to  perform  th  T icner 
filtering  operations  of  eq,  (2).  For  long  vector  lengths  the  multiplicati  "i 
requirements  of  the  Fourier  and  Hadamard  transforms  approach  those  of  the 
identity  operator.  The  Karhunen-Loeve  transform  requires  more  than  twice 
as  many  multiplications  as  the  identity  operator.  The  computational  require¬ 
ments  for  filter  generation  are  considered  in  the  next  section. 
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Table  2 

Optimal  Wiener  Filtering  Computation  Requirements 

Transform 

Approximate  number  of 
multiplication /addition 
operations 

Identity 

M2 

Karhunen -  Loe ve 

2M2*  M 

Fourier  /  Hadama  rd 

M2+2M  log  2M 

M  =  2m  wk^e  m  is  an  integer 


Suboptimal  Wiener  Filtering.  -When  considering  only  the  amount  of 
computation  required  to  perform  the  Wiener  filtering  operation,  as  opposed  to 
the  computation  requirements  for  the  filter  generation,  it  is  obvious  that  the 
bulk  of  the  computation  for  the  Fourier,  Iladamard,  and  identity  transform 
systems  can  be  attributed  to  the  filter  matrix  multiplication  operation.  The 
.  Karhunen-Loeve  transform  system  only  requires  a  scalar  filter  multiplication, 
but  entails  two  matrix  multiplication  operations  for  the  transformations.  As  a 
compromise,  consideration  has  been  given  to  the  development  of  filter  matrices 
that  contain  a  relatively  large  number  of  zero  entries.  With  such  matrices  the 
filter  operation  could  be  performed  with  a  reduced  number  of  computations.  The 
goal  in  the  filter  design,  of  course,  is  to  maintain  the  m.  s.  e.  performance  as 
close  to  the  optimum  filter  level  as  possible. 

The  suboptimal  Wiener  filtering  design  problem  can  be  formulated  as  an 
optimization  problem  whereby  the  filter  matrix,  G,  is  chosen  to  minimize 
the  rr> .  s .  e , 


e 


Tr 


C  -2GC  +GG(C  +C  ) 
S  S  '  S  N 


(15) 


unde  the  constraint  that  certain  selected  elements  of  G  are  zero.  The 
zero  elements  of  G  are  so  placed  that  fast  computational  algorithms  can 
be  employed  in  the  filter  matrix  multiplication. 

As  a  special  case  of  this  technique  consider  the  filter  design  when  the 
filter  matrix  is  constrained  to  be  a  diagonal  matrix.  The  optimum  filter 
is  found  to  be  [3  ] 

Cg(i»  i) 

Cg(i,  i)-l  CN(i,  ]J~ 


G  =  diag 


(16) 


and  the  resultant  m.  s.  e.  is 


M  cs(i,  ij+cN(i,;> 

Conceivably,  one  could  determine  the  filter  design  when  the  filter  matrix 
contains  two  elements  per  row,  and  continue  with  additional  matrix  ele¬ 
ments  until  the  desired  m.  s.e.  is  achieved.  Unfortunately,  the  optimi¬ 
zation  problem  rapidly  becomes  quite  complex. 

One  alternative  to  the  optimization  is  to  use  selected  elements  of  :he 
optimum  vector  Wiener  filter  matrix  as  the  suboptimal  filter  matrix.  For 
example,  only  the  diagonal  and  near  diagonal  terms  of  the  optimum  filter 
matrix  could  be  retained.  Another  technique  is  to  select  only  those  terms 
of  large  magnitude.  For  a  magnitude  threshold  filter  it  has  been  found  that 
the  identity  filter  performs  quite  poor'  /  compared  to  the  other  transforms  be¬ 
cause  of  its  poor  energy  compaction  properties.  Also,  the  performance  of  the 
Fourier  and  Hadamard  transforms  for  as  few  qj  about  ion  percent  non-zero 
terms  is  quite  close  to  the  optimum  performance. 

Image  Enhancement  Applications: 

The  generalized  Wiener  filtering  technique  has  been  applied  to  the  enhance¬ 
ment  of  images  as  a  test  of  its  validity.  In  this  application  each  image  line  and 
column  has  been  characterized  as  a  Markov  process  with  additive  white  noise. 

The  image  was  divided  into  16  by  16  element  blocks  for  computational  simplicity. 
Each  block  was  filtered  using  an  optimal  Hadamard  domain  Wiener  filter  and  a 
Wiener  filter  with  the'smallest  75%  of  the  terms  removed.  In  both  cases  the  noise 
effects  were  substantially  reduced  compared  to  the  original.  \ ' 
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.-3".  Image  Detection 

The  research  results  fall  into  the  five  major  categories  listed  below: 

a.  Detection  and.  Multidimensional  Rotation 

b.  Detection  and  the  Minimum  Spanning  Tree 

c.  Image  Similarity 

d.  Variance  of  Bayes  Estimates 

e.  Detection  and  Lateral  Inhibition 

The  first  two  topics  can  be  best  described  as  feature  selection  mechanisms  for 
dimensionality  reduction  in  the  image  detection  and  classification  problem. 

..  In  these  cases,  an  II  dimensional  vector  space  approach  is  developed  and 
utilised  to  obtain  significant  discriminatory  features  for  input  to  a  decision 
algorithm  for  detection  purposes .  The  third  approach  is  one  of  developing  a 
measure  of  image  similarity.  Such  a  measure  will  then  allow  a  decision  to  be 
made  based  upon  a  degree  of  similarity  between  images.  This  could  be  considered 
in  analogy  with  the  two  dimensional  matched  filter.  The  fourth  result  includes 
some  statistical  analyses  developing  new  properties  of  the  variance  of  Bayes 
estimates.  While  not  directly  couched  in  the  notation  of  imagery*  the  results, 
nonetheless,  have  bearing  in  detection  problems  utilizing  Bayesian  Estimators. 
The  fifth  topic  is  one  in  which  the  concept  of  lateral  inhibition  is  utilized 
in  an  attempt  to  obtain  a  linear  transformation  which  best  implements  the 
inhibition  process  for  image  detection. 
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In  the  context  of  an  image  detection  and  recognition  environment ,  it  becomes 

> 

immediately  obvious  that  for  machine  implementation,  the  dimensionality  of 
the  problem  is  far  too  large.  Specifically,  if  one  analyses  the  degrees 
of  freedom  which  comprise  an  "SxK  sampled  image,  and  attempts  to  develop 

p 

decision  algorithms  in  the  ir  dimensional  space  defined  by  the  image,  compu¬ 
tational  power  suddenly  becomes  a  premium.  In  order  to  reduce  the  dimensionality 
of  the  problem  to  a  set  of  features  which  retain  detection  significance  while 
removing  those  dimensions  which  provide  little  or  nothing  to  the  decision 

O 

process,  it  is  necessary  to  consider  the  image  as  a  point  in  II  dimensional 
space.  Then  by  appropriate  transformations ,  be  they  linear  or  nonlinear,  it 
will  be  possible  to  retain  a  fewer  set  of  coordinates  or  degrees  of  freedom 
while  maintaining  detection  integrity. 

I 

One  method  of  particular  interest  for  dimensionality  reduction  is  a  linear 
transformation  provided  by  unitary  matrices  and  often  referred  to  as  multi¬ 
dimensional  rotations.  Five  such  rotations  were  investigated  as  to  their 
potential  for  discriminating  significant  features  in  an  image  detection  problem 
involving  the  recognition  of  hand  written  characters.  The  five  rotations  of 
interest  were  the  Karhunen-Loeve,  Fourier,  V.'alsh,  Kaar,  and  Identity.  The 
results  of  the  experiment  suggested  that  large  dimensionality  reduction  factors 
were  indeed  possible  with  little  cr  no  degradation  in  detection  performance 
and  the  order  of  the  cost  efficient  rotations  is  as  listed  above.  The  papers 
at  the  end  of  this  report  present  the  details  and  published  results .on  this 
topic. 
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DETECTION  AND  THE  MINIMUM  SPATEIIEO  TE?I 


The  use  of  multidimensional  rotations  discussed  above  is  restricted  to  linear 


techniques  and  will  not  cover  the  cases  where  dimensionality  reduction  requires 

2 

nonlinear  operations.  As  an  example,  consider  a  class  of  images  in  N  space 
whose  intrinsic  discriminatory  dimensionality  lies  on  a  nonlinear  surface 
undetectable  by  linear  rotations.  Specifically,  data  points  on  a  sphere  are 
intrinsically  two  dimensional  although  distributed  in  3  space.  In  order  to  be 
able  to  handle  such  situations,  (which  are  more  likely  than  linear  intrinsic 
dimensions)  nonlinear  recursive  search  techniques  must  be  utilized.  Research 
in  this  direction  is  currently  underway  where  considerable  emphasis  is  placed 
on  the  concept  of  the  minimum  spanning  tree  ('-1ST).  The  K3T  is  a  unique 
connection  of  images  (points  in  H2  space)  such  that  they  are  linked  with  the 
shortest  total  distance.  The  tree  has  the  potential  for  maintaining  local 
measures  at  the  expense  of  global  properties  and,  thus,  provides  a  clue  to 
measuring  intrinsic 'dimensionality.  As  research  on  this  topic  was  just  initiated 
it  is  too  early  to  present  conclusive  results,  but  the  subject  is  under  current 
investigation  with  indications  of  developing  a  promising  nonlinear  discriminatory 
feature  selector  for  image  detection. 


IKAG3  SIMILARITY 

A  new  measure  of  similarity  between  images  has  been  developed  which  is  proposed 
as  a  useful  detection  criterion.  The  measure  has  information  theoretic  properties 
and,  with  proper  image  normalization,  is  the  entropy  function.  The  measure  is 
convex  upward  reaching  a  peak  when  two  images  are  identical  and  thus  perfect 


image  detection  occurs  at  the  maximum  of  this  image  similarity  measure.  The 
results  of  this  research  are  well  documented  in  the  papers  at  the  end  of  this 
report.  However,  ir.  addition  to  providing  a  similarity  measure,  techniques  for 


dealing  with,  magnificat ions  and  rotations  are  also  discussed  in  that  document 
and  provide  a  type  of  preno realization  as  input  to  the  similarity  measure . 

VARIANCE  0?  BAYES  ESTIMATES 

\ 

This  portion  of  the  image  detection  research  concerned  itself  with  developing 
tighter  hounds  on  the  estimates  obtained  from  Bayesian  techniques.  Specif ically, 
exponential  hounds  are  obtained  and  provide  confidence  measures  as  more  and 
nore  samples  are  taken.  The  results  of  the  research  are  far  more  widespread 
than  just  applying  to  image  detection.  In  fact,  any  Bayesian  estimator  falls 
within  the  bounds  obtained  in  this  research.  The  paper  describing  this  work 
is  included  in  the  papers  section  and  goes  into  considerable  detail  on  the 
subject. 

DSTECTI Cl  AND  LATERAL  INHIBITION 

The  contents  of  the  research  on  this  subject  examines  neural  net  properties, 
in  particular  lateral  inhibition  as  a  possible  mechanism  for  extracting  features 
important  to  image  detection.  Spatial  filters  have  been  investigated  vhich 
emphasize  edges  in  a  pattern.  Biologically,  this  spatial  process  is  called 
lateral  inhibition  in  that  a  neuron  inhibits  surrounding  neurons  directly 
proportional  to  its  firing  frequency  and  inversely  proportional  to  distance. 
Lateral  inhibition  is  a  spatial  filter  that  emphasizes  edges  or  boundaries 
between  contrasting  light  levels  and  may  play  a  part  in  identifying  objects. 
Linear  transformations  were  developed  mathematically  to  model  such  processes 
and  extensive  biological  and  engineering  literature  searches  were  undertaken  to 
solidify  the  model.  There  is  still  controversy  over  whether  a  visual  pattern 
is  recognized  as  a  whole  or  whether  it  is  perceived  through  the  combination  of 
several  elements  or  features.  Just  how  meaningful  features  are  extracted 
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from,  incoming  sensory  information  is  not  krona,  except  for  lateral  inhibition 
which  enhances  edges  or  b,  andarios. 


4 

/{.  Linear  Image  Transformation 


Linear  transformations  of  picture  segments  play  a  role  in  bandwidth 
compression,  image  enhancement  and  pattern  recognition.  The  two  dimen¬ 
sional  structure  of  the  data  leads  to  rather  large  linear  spaces  and  makes 

computation  of  a  transforri  via  its  describing  matrix  impractical.  A  square 

2 

picture  segment,  whose  linear  dimension  is  N,  has  N  values  and  the 

4 

computation  of  a  linear  transformation  by  matrix  multiplication  requires  N 
multipliers.-;  Clearly  faster  algorithms  are  needed. 

The  Fourier  transform  and  Hadamard  transform  each  possess  algorithms 
which  require  M  ln^  Ivi  operations  on  M  points.  However,  these  are  only 
two  of  a  large  class,  called  group  character  transforms,  which  have  fast 
algorithms . 

The  above  algorithms  are  most  efficient  when  M  is  a  power  of  2. 

This  class  can  be  further  generalized  as  follows:  Let  L^  be  the  matrix  of 
a  linear  transform  in  N  dimensional  space  which  has  a  fast  algorithm.  If 
A  is  a  2N  by  2N  matrix  with  at  most  \  non-zero  entries  in  each  row  then 


has  a  fast  algorithm:  If  Vj,  are  of  dimension  N  and  is  the  number 
of  additions  required  to  compute  L^V.  ther 


requires  2W^j+2(X-l)N  additions.  Repetitive  use  of  this  structure  yields 
algorithms  whose  computation  time  grow  as  N  ln^N.  Such  constructions 
can  be  used  to  design  ti  .nsforms  with  fast  algorithms  and  some  additional 
property.  For  instance,  the  following  modification  of  the  Hadamard  con- 
!  struction  produces  a  matrix  with  the  linear  function  {ah+’o  :  h=l,  .  .  .  ,N} 


as  one  of  its  rows. 


ai  =  1 


and  define  recursively  for  N  a  power  of  2  with  1^  an  N  by  N  identity  matrix 


__  _ IN  ,  ^ 

l2N"  /,  2  ’  2N  r  2 

,  V1+4aN  v  +4&N 


/  *N 


"a2NIN  b  I  a,  I  ,  _ 

2N  N  2N  N  b  I 
2N 


{Wn  ■a2NIN  'b2N:N  'a2MIf. 


L2N  0 


The  2N  row  of  L,.„,  is  the  linear  function.  The  matrix  also  has  the 
4N 

property  th L  is  an  orthogonal  matrix. 

Whether  or  not  this  class  of  transforms  is  useful  for  picture  analysis 


has  not  yet  been  studied. 
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The  Karhunen-Loeve  transform  plays  an  important  role  in  picture 

analysis  but  does  not  have  a  fast  algorithm.  This  transform  is  defined  as 

follows.  Let  V  be  an  N-dimensional  random  vector  and  compute 

K..  =  E  fv.  v.  },  where  E{  }  denotes  ensemble  average  (expected  value), 
ij'  i  J 

Let  the  rows  of  L^  be  the  unit  eigenvectors  of  R  in  order  of  decreasing 

values  of  corresponding  eigenvalue.  This  transform  has  the  property  that 

•  « 

most  of  the  "energy"  of  the  random  vector  is  in  the  first  few  coordinates. 
When  the  vector  is  formed  by  scanning  the  elements  of  a  picture  in  linear 
order,  the  eigenvectors  tend  to  be  sines  and  cosines  of  increasing  frequency. 
The  frequencies  are  not  integer  multiple  of  a  fundamental,  so  the  fast 
Fourier  transform  does  not  apply.  However,  with  the  exception  of  the  first 
few  terms,  cosines  and  sines  alternate  as  eigenvectors  and  the  frequencies 
differ. by  one  half  the  fundamental.  Thus,  the  transform  can  be  approximated 
by  two  FFT's  with  some  extra  work  involving  projection  of  the  vector  onto 
the  first  few  rows  of  the  matrix.  A  fast  algorithm  for  the  inverse  trans¬ 


form  has  not  yet  been  found. 


Transform  Implementation 


The  Hadamard  transform,  requiring  no  multiplications,  seem  to 
be  the  simplest  for  which  to  design  a  hardware  implementation.  Let 
(U  , . . . ,  U  )  be  the  Hadamard  transform  of  (V  ,  V  \  Then 

On  n  1  /* 


where 


U.  =  2(-l)(l,;i)v. 

1  j  3 


i  =  i  +2i  +. .  .  2n  *i 

12  n 


j  2n_1jn 


(i.j)  =  Si„jn 


The  above  equation  can  be  viitten  to  exhibit  the  structure  of  its  fast 


algorithm. 


i.  =  . 

i  j_“.  i 


(.l),"'n^  (.||W»-2  y 


The  recursion  described  by 


So(j|»  •  •  • ,  j  )  =  V. 

J 


sk(ir  •  •  •  ’  V  jk+r  •  *  •  ’  jn)  =  sk-  i(ir  *  *  •  •  V  i*  °» h+i*  •  •  •  *  jn} 


K 

+  (-1)  S  ,(i, ,  . . . ,  j.  ,  l,i  •  \ 

k-1'  1’  k-  1,  *Jk4  l »  •  •  • »  JrJ 
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These  equations  then  lead  directly  to  the  logical  diagram  of  Figure  1. 

When  two  consectutive  stages  are  connected,  the  final  delay  of  the 

£h  s  t 

k  stage  may  be  incorporated  into  fhe  initial  delay  of  the  (k+1)  stage. 

Therefore,  a  transform  on  2‘  points  requires  2  +2  units  of  delay, 

n  adders  and  the  associated  gating  circuits.  It  is  estimated  a  six./four 

point  transfov.,  with  eight-bit  input  samples  and  twelve-bit  output  samples, 

using  medium  scale  integrated  circuits  at  five  megahertz  would  require 

less  than  $1500  in  components. 
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